Search Results
Multimodal Pretraining for Dense Video Captioning
Multimodal Pretraining for Dense Video Captioning
Multi-modal Dense Video Captioning (CVPR Workshops 2020)
ActivityNet Event Dense-Captioning
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Lecture 18. Image/Video Captioning
Dense Video Captioning with Semantic Features and Attention
Video Captioning
PLLaVA: Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning
[CVPR 2023] Vid2Seq: 8 Min Presentation
Dense Captioning of Images - Video Demo
GRU-based Automated Video Captioning on Android Mobile Devices - Spring 2021